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Three different receiver arrangements are studied, all of which 
incorporate provision for joint adaptive echo cancellation and gain 
adjustment to provide two-wire full-duplex data communication. In 
each case, the canceler consists of a data-driven transversal filter, 
but the architectures differ in the way the gain adjustment is provided. 
For all architectures, we investigate the properties of a joint adaptive 
lms algorithm based on the receiver's decisions on far-end data 
symbols and present appropriate computer simulations. We show 
that an arrangement where the gain control adjusts the reference 
level after the decision detector output performs significantly better 
than agc schemes attempting to adjust the level of the analog signal. 

I. INTRODUCTION 

High-speed full-duplex data communication on a single channel is of 
immense practical interest. Data transmission via the ddd telephone 
network and the possibility of future digital subscriber lines are two of 
the most challenging applications. Techniques for achieving this goal 
fall in essentially three categories: Frequency Division Multiplexing 
(fdm), Time Division Multiplexing (tdm), and echo cancellation. Only 
echo cancellation allows full-bandwidth continuous use of the channel 
in each direction. This scheme therefore offers the highest potential 
bit rates. 

The transmitter and receiver are jointly coupled to a two-wire line 
via a hybrid. In an environment of changing channel characteristics 
(e.g., switched network), the hybrid balancing, if fixed, will at best 
provide a compromise match to the channel. In this mode, a vestige of 
the local transmitted signal, leaking through the hybrid, can be ex- 
pected to interfere with the incoming signal from the far-end simulta- 
neously operating transmitter. Figure 1 shows the system under dis- 
cussion, and Fig. 2 models the signals entering and leaving a two-wire 
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Fig. 1— Basic system configuration. 
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Fig. 2 — System model (without canceler). 

full-duplex modem. The local transmitter transmits a sequence of data 
symbols {b(n)} at T-second intervals as a pam data waveform. The 
received waveform r(t) consists of a pam data waveform with data 
symbols {a(n)} transmitted also at T-second intervals from the distant 
end, plus noise, plus the interfering vestige of the locally transmitted 
signal. This interfering signal (which we shall refer to as the echo 
signal or echo component) may have power comparable to or even 
greater than that of the desired far-end signal component. 

Decisions on the [a(n)} are made by quantizing the sampled re- 
ceiver output to ±1 in the case of binary data, or to one of M values in 
the case of M-level data. A typically encountered echo component 
arising in a system with a conventional compromise balanced hybrid 
will cause an unacceptably high error rate. 

To remove the interfering echo component, the local receiver must 
perform echo cancellation; that is, estimate the echo signal and sub- 
tract it from the incoming signal prior to making decisions, as shown 
in Figs. 1 and 3a. The estimate is a transversally filtered version of the 
local data symbols [b(n)}* as proposed in Ref. 1. If the {b(n)) are 



* The {b{n)) may be different from the user data since they are defined to include 
such operations as differential encoding or scrambling. 
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binary, the implementation is simple, requiring only additions and 
subtractions. The transversal filter tap coefficients {p m ) should ap- 
proximate the samples of the impulse response of the combination of 
the local transmitter and the echo path. 

Equivalently, 1 the tap coefficients should be chosen to minimize, in 
a mean-square sense, the measured receiver error signal which is the 
difference between the actual receiver output y(n) and the ideal 
output. This error is available at each sampling instant of the received 
data. The subtraction of the (decision-directed) reference {d{n)} is, of 
course, what makes it possible to adapt quickly even in the presence 
of doubletalk. Such adaptation allows tracking time-varying compo- 
nents of the echo channel or coping with larger call-to-call variations 
in a switched system. However, since the level of the received signal is 
likely to vary significantly under those conditions, it is essential that 
proper scaling be done when the error signal is computed. Such scaling 
involves a gain adjust device which must operate jointly and adaptively 
with the echo canceler. This paper deals with this joint adaptation 
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Fig. 3 — Receiver structure with echo canceler and automatic gain control, (a) Overall 
system showing various locations for agc function, (b) Basic agc functions in one of 
above marked locations A, B, C, or D. 
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problem. Although we perform our study for a system operating at 
baseband, our results can be applied to passband structure via appro- 
priate redefinitions. 

As shown in Fig. 3a, the error e(n) is the difference between the 
receiver's decision d(n) (assumed correct) and the receiver's output 
y(n). But if the end-to-end channel gain is a, y(n) consists of a a(n) 
plus possibly noise and uncanceled echo. Intersymbol interference is 
not treated in this study. It is expected to be of secondary concern in 
typical subscriber cable systems, but we realize that it cannot be 
neglected for high-speed ddd applications. The error e(n) in the 
absence of gain control would contain the term (a — l)a(n), which is 
relatively large if a differs significantly from unity. Therefore, a small 
steady-state mean-squared error, with consequent minimal fluctuation 
of the canceler tap coefficients {p n } and low error rate, is possible only 
if the nonunity gain a is compensated for by an agc adjusted to 
provide a gain approximating 1/a. The gain of the agc, denoted w, 
will be considered to be adjusted jointly with the echo canceler tap 
coefficients to minimize the mean squared error. 

Shown in Fig. 3a are four possible locations, A, B, C, and D, for 
placement of the agc whose functional form is depicted in Fig. 3b. 
Since D is identical to B as long as the signal is binary and the 
quantizer is ideal, only three variations need to be examined. The 
corresponding three receiver architectures, labeled naturally A, B, and 
C, may differ significantly in their adaption speeds. The main theme 
of this paper is the convergence of each of the three receiver arrange- 
ments, and how it is influenced by channel parameters, such as the 
relative powers of echo and far-end components. 

The study concludes that arrangement C which features an auto- 
matic reference control (arc) at the quantizer output offers the fastest 
convergence rate, together with the simplest implementation. Arrange- 
ments A and B may suffer slower rates of convergence due to coupling 
interaction between the agc and echo canceler tap coefficient adap- 
tation. However, for arrangement B, a simple step-size modification is 
proposed, which on the average decouples the agc and echo canceler 
adaptations, thus improving its convergence rate. Readers interested 
in C, the "best" arrangement, may skip Sections III to V, which deal 
with A and B. 

Mention of earlier related work is in order at this point. Adaptive 
echo cancellation without a jointly adapting agc or equalizer for two- 
wire full duplex data communication has been treated in Refs. 1 to 4. 
References 1 and 4 treat essentially the echo-cancellation system 
proposed here, assuming a is known, and thus omit the agc. This type 
of scheme, in which the echo canceler's input consists only of local 
data symbols, offers obvious simplicity of implementation. In Ref. 2, a 
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voice-type canceler is investigated, and Ref. 3 discusses an external 
data-driven structure which cancels the entire waveform by computing 
compensation samples at a high enough rate.* Reference 5 summarizes 
work reported in Ref. 1 and 3, and discusses a further receiver arrange- 
ment comprising an adaptive echo canceler and adaptive equalizer for 
mitigating end-to-end linear distortion. Arrangement A considered 
here, the so-called "convex canceler," is a special case of this, since an 
agc can be considered a one-tap linear equalizer. Reference 6 describes 
an echo-cancelling receiver structure incorporating decision feedback 
equalization. Finally, our arrangement C combined with decision feed- 
back equalization has recently been proposed in Ref. 7. 

II. SYSTEM MODELING 

We shall examine the convergence properties of several adaptation 
strategies for the three receiver arrangements. Each follows a decision- 
directed approach; agc and echo canceler parameters are adjusted 
once per symbol interval, based on the observed error between the 
unquantized receiver output and the decision d (n ) (for arrangements 
A and B) or wd(n) (for arrangement C). For purposes of analysis, the 
decisions d (n ) are assumed equal to the actual data symbols a(n). The 
convergence of decision-directed adaptive receivers making small ad- 
justments at each iteration has been found to be negligibly affected by 
occasional decision errors. 

The analysis is based on a simple linear model of the end-to-end 
channel and leakage paths: the signal r(t) entering the receiver from 
the hybrid will be written as 

r(t) = I a(n)g A (t -nT) + % b(n)g B (t - nT) + v(t). (1) 

n n 

The first summation represents the signal from the far end, and g* (t) 
is the end-to-end channel impulse response, including receiver front- 
end filtering. The second summation is the echo signal, and #b(£) is 
the impulse response of the echo path. The symbol v(t) is a waveform 
of additive white noise. The symbol interval T is equal in both 
summations. This is tantamount to assuming that the near- and far- 
end transmitters are synchronized in clock frequency. 1 

Decisions d(n) are made by quantizing samples of the receiver's 
output to ±1 in the case of binary data. The end-to-end channel 
impulse response and the phase of the receiver's sampling clock are 
assumed ideal, so that no intersymbol interference is present in the 
samples of r(t); i.e. the nth sample is of the form 



* In Ref. 2, the canceler input is the sampled transmitted waveform. In Ref. 3, it is 

the (b(n)). 
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r(nT) = a(n) a + £ b(k)fi n -k + v(nT), 

k 



(2) 



where a = g A (0), g A (nT) = for n 5* 0, and /?„ s= # B (nT). In subsequent 
notation, we write r(nT) and v(nT) as r(n) and p(/i), respectively. The 
binary (±1) data symbols [a(n)} and {b{n)} are statistically inde- 
pendent. The noise samples v(n) are assumed to be independent with 
zero mean and variance a 2 . 

III. ARRANGEMENT A 

We recall from Fig. 3a that arrangement A forms the receiver output 
as the sum of the echo canceler and agc outputs. The receiver output 
is a linear function of the echo canceler tap coefficients and agc gain. 
Thus the mean-squared error at the receiver's output is a convex 
quadratic function of the receiver parameters, and a simple gradient 
algorithm can be used with confidence to adjust the parameters jointly. 
As mentioned before, if the agc is replaced by an adaptive linear 
equalizer, the arrangement A generalizes to the jointly adaptive echo 
canceler equalizer structure discussed in Ref. 5. 

Given an agc gain w and a set of N echo canceler tap coefficients 
{Pk)k-\, the receiver's unquantized output sample y(n) is 



y(n) = w r(n) + £ Pkb{n - k). 



(3) 



Define the N-dimensional vectors 
"Pi 



P = 



Pn 



and b(n) = 



'6(11-1) 



_b(n-N)_ 



and the {N + 1) -dimensional vectors in partitioned form as 



c = 



and z(n) = 



b(n) 



Then (3) is written more compactly as 

y(n) = c f z(n), 



(4) 



where f denotes transpose. The vector c is the set of receiver param- 
eters to be adaptively adjusted, and z(n) is the current set of inputs 
stored by the receiver. 
The ideal output at time nT would be a(n), and the error is 



1598 THE BELL SYSTEM TECHNICAL JOURNAL, SEPTEMBER 1979 



e(n)=y(n)-a(n). (5) 

The expression for the mean- squared error is 

(e(n) 2 ) = cMc - 2c + x + 1, (6) 

where A is a {N + 1) by (N + 1) covariance matrix 

A = (z(n)z(n)\ (7) 

and x is an (N + 1) -dimensional vector 

x=(a(n)z(n)). (8) 

By rewriting (6) as 

(e(n) 2 ) = (c - A-'x) + A(c - A''*) + 1 - x+A-'x, (9) 

and recognizing that A by definition is positive semidefinite, it is clear 
that the mean-squared error has its minimum value 



p 2 



= 1 - xU" 1 !, 



(10) 



when 



c = Copt = A J x. (11) 

Using the independence assumptions for the data symbols and noise, 
and the expression (2) for r(n), we readily find that A can be written 
as 



A = 



Aoo\P 
P \l 



where 



Aw= (r(n) 2 ) =a 2 + S^ + a 2 , 



P = 



Pi 



-Pn. 



(12) 



(13) 



(14) 



denotes the sampled echo impulse response truncated to N samples, 
and / is the iV-dimensional identity matrix. Note that, because of the 
truncation involved in defining fi, \ P \ 2 < Xa 01. Similarly, the vector 
x is 



x = 



(15) 
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where is an TV-dimensional all-zero vector. 

Adaptive adjustment of the receiver parameter vector c can be 
accomplished by employing the Widrow-Hoff lms algorithm 8 just as 
in adaptive mean-square equalization. 9 The current value of c at time 
nT, c(n), is then updated according to 

c(n + 1) = c(n) - y e(n)z(n), (16) 

where y is a constant step size. The average value of the correction 
term — ye(n)z(n) is proportional to the negative of the gradient of the 
mean-squared error with respect to c. Expression (16) portrays the 
joint updating of the agc gain and echo canceler taps: 

win + 1) = w(n) - ye(n)r(n). (17a) 

p(n + 1) - p(n) - ye{n)b(n). (17b) 

Adaptation of the echo canceler alone, according to (17b) with w 
fixed at 1, has been analyzed by Mueller. 1 The rate of convergence of 
p(n) to ft for an optimum choice of y was shown to be determined only 
by the number of taps, rather than by the detailed characteristics of 
the echo path. 

The joint updating algorithm (16) resembles equalizer adaptation 
algorithms, whose convergence behavior has been extensively stud- 
ied. 7-16 The convergence of the more general version of (16), for joint 
echo cancellation and equalization, was discussed in Ref. 5. These 
theoretical studies have rested on an untrue assumption of indepen- 
dence of successive equalizer or canceler contents. However, a more 
rigorous analysis in Ref. 16, in addition to experimental results, sug- 
gests that the independence assumption does not cause serious error. 
The aforementioned studies have revealed that, for a fixed step size 
coefficient y, the speed of convergence is largely governed by the 
spread of the eigenvalues of the matrix A defined by (12). Without 
elaborating on the details, we can say that a ratio of maximum-to- 
minimum eigenvalues which is close to unity leads to relatively fast 
convergence, while slow convergence is associated with a maximum- 
to-minimum eigenvalue ratio which is much greater than unity. If the 
step size coefficient y is chosen to effect a judicious compromise 
between speed of convergence and noise due to random tap fluctua- 
tions, then a system with a small eigenvalue spread would typically 
converge in a number of iterations equal to a small multiple of the 
number of adjustable tap coefficients. 

The N + 1 eigenvalues of matrix A defined by (12), (13), and (14) 
are readily found to consist of N — 1 unit eigenvalues plus A ma x and 
Amin, given by 
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A m ax = l /« [1 + « 2 + O* + X Pi ± 
min * 



V(l + a 2 + o 2 + lfi) 2 - 4(a 2 + o 2 + £# - ffni. (18) 

k k 

where the plus is associated with A max and the minus with Amin. It is 
straightforward to show that 

Amin <■!<■. A ma x- 

It is interesting to point out that expression (18) for A m ax and Amin 
coincides with the expressions for bounds on the maximum and mini- 
mum eigenvalues found in Ref. 5 for the more general canceler/ 
equalizer combination. Table I shows values of A max and A m in for various 
values of (a 2 + a 2 ) (the power of the far-end signal component plus 
noise) and £/, fil (the power of the echo component), assuming £*/?* 
= | /? 2 , so that perfect echo cancellation is possible. 

The ratio A max /A min increases rapidly as a 2 + ct 2 decreases. Thus a 
system with a far-end signal component which is much weaker than 
the near-end echo component would be expected to converge much 
more slowly than one with a relatively strong far-end component. This 
sensitivity of the convergence behavior to the relative strengths of far- 
end and near-end signal components was also noted in Ref. 5 for the 
canceler/equalizer system. 

IV. ARRANGEMENT B 

In this arrangement, combining the received signal and echo canceler 
output is done ahead of the agc. This may appear as a more natural 
arrangement, since the intent is to cancel the echo component at the 
sampling instants before they enter the portion of the receiver devoted 
to estimating a(n). The local data symbols are processed by the echo 
canceler and agc in tandem; the output is not a linear function of the 
canceler tap coefficients and agc gain, and the mean-squared error is 
not a convex function of these receiver parameters. We can thus call 
this second arrangement a "nonconvex canceler." 

The receiver output is 

Table I 



Far-End Signal Power 
Plus Noise Power 


Echo Power 


Eigenvalue 


Ratio 






a 2 + a 2 


|/»| 2 


Amin 


Xma. 


Aa/Xi 


0.1 


1.0 


0.0487 


2.015 


41.4 


0.35 


1.0 


0.1598 


2.190 


13.7 


0.50 


1.0 


0.2187 


2.281 


10.4 


1.0 


1.0 


0.3820 


2.618 


6.85 


2.0 


1.0 


0.5858 


3.414 


5.83 


1.0 


2.0 


0.2680 


3.732 


13.9 
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y(n) = w[r(n) + p + b(n)]. (19) 

With r(n) modeled as in eq. (2), the mean-squared error for given 
values of w and p is 

(e(n) 2 ) = ((y(n) - a(n)) 2 ) 

= w 2 [a 2 + Z(3l + o 2 + \p\ 2 + 2p*P] 

k 

- 2 wa + 1, (20) 

which can be written as 

(e(n) 2 ) = (wa- l) 2 + |p + 0|V 

+ w 2 [o 2 + ZPl-\p\ 2 ]. (21) 

The final term in brackets represents the effect of additive noise and 
uncancellable echo, if any. Expression (21) can also be written as 

(e(n) 2 ) = (a 2 + 8 2 )(w - w opt ) 2 + w 2 \p- p opl | 2 + -£-$, (22) 

a + o 

where 

8 2 = o 2 + Z$-\fi\ 2 (23a) 

and 

a 

"Wt - 2 ^_ g 2 < 23D ) 

a + o 
Popt = -fi (23c) 

are the parameter values which minimize (e (n) 2 ). In practical systems, 
5 is small and w op t ~ 1/a. 

It is instructive to plot contours of constant mean-squared error in 
the plane whose coordinates are the agc error 

e w = w — Wopt (24) 

and canceler error magnitude 

I e P | = | p - Popt | , (25) 

respectively; i.e., curves satisfying 

1 l " jJ 2 = mse (26) 

a/ 

(assuming S is negligible), for various positive values of mse. Such 
contour plots are shown in Figs. 4 and 5 for values of the end-to-end 
gain a of 0.5 and 1. Each plot shows 10 contours for mse ranging from 
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Fig. 4 — Contours of constant mean-squared error for arrangement B (a = 0.5). 
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Fig. 5 — Contours of constant mean-squared error for arrangement B (a = 1.0). 



0.2 to 2.0 in steps of 0.2. In each case, there is a unique minimum 
(e w = | g p | = 0), but the mse surface is not convex. Moreover, it exhibits 
a kind of trough running along the line e w = —1/a. The floor of this 
trough slopes very gradually toward the origin for large values of | e p | . 
The shape of the contours for this system differs radically from the 
ellipsoidal contours that are characteristic of the convex system. Those 
contours are plotted in Figs. 6 and 7 for the same values of mse and of 
a and | fi | = 1, as for the nonconvex system. The ellipses are plotted 
for convenience for error values mapped onto the two eigenvectors /ii 
and [I? of the matrix A. The eccentricity of the ellipse is the ratio A max / 
Arnin- Similarly, the convex system would converge slowly, starting from 
zero- valued parameters if a is small (implying A m axA m i n » 1). 

A gradient procedure (lms algorithm) for jointly adjusting w and p 
should, with proper choice of step size, converge to the optimum 
parameters. It would, on the average, follow a path perpendicular to 
the contours that it crosses. Thus very slow convergence of the 
nonconvex system would be expected if the initial value of all param- 
eters is zero; i.e., starting at e w = —(1/a), \e p \ = \fi\. 

An lms algorithm for the nonconvex system is obtained by making 
the correction terms proportional to the negative gradients of the 




AGC ERROR 

Fig. 6 — Contours of constant mean-squared error for arrangement A (a = 0.5, | fi | = 1). 
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AGC ERROR 

Fig. 7— Contours of constant mean-squared error for arrangement A (a = 1, | /? | = 1). 



squared error. The resulting joint AGC/canceler updating algorithm 
would then be 

w(n + 1) = win) - ye(n)(r(n) + b(/i) f p(«)) (27a) 

p(n + 1) = p(n) - ye(n)w(n)b(n), (27b) 

e(n) = w(n)(r(n) + b(n) + p(n)) - a(n). (27c) 



where 



V. REDUCING COUPLING EFFECTS ON THE NONCONVEX SYSTEM'S 
ADAPTATION 

Combining (27b) and (27c) results in the following algorithm for 
updating p(n): 

p(n + 1) = pin) - ywin) 2 bin)[rin) + b(n) + p(")] 

+ ywin)ain)bin). (28) 

It would be desirable to reduce or eliminate the coupling effect of win) 
in this algorithm. Consider modifying the algorithm by replacing the 

ADAPTIVE ECHO CANCELLATION/ AGC STRUCTURES 1605 



constant step size y with y/w(n) 2 , where the second y is a constant. 
Then (28) becomes 

p(n + 1) = p(n) - yb(n)[r(n) + b(n) f p(n)] + — ?— a(n)b(n). (29) 

w{n) 

The average value of the correction term for fixed p(n) is then 
- y\fi + p(/i)], which is the same as the average correction term for 
an adaptive echo canceler with no agc. Thus by the simple choice of 
step size y/w(n) , we can on the average remove the influence of agc 
adaptation on echo canceler adaptation. The only remaining coupling 
stems from the term [y/w (n)]a(n)b(n), whose mean value is zero. The 
mean of the correction term in (27a) for updating the agc gain is 

-y(e(n)(r(n) + b(ny P (n))) 

= -ya(w(n)a - 1) - yw(n) | p(/t) + 1 2 

-yw(n)^ + ^0i-\P\ 2 ), (30) 
k 

in which coupling from the echo canceler adaptation is evident in the 

middle term yw(n) | p(n) + ft | 2 . There is no simple way to eliminate 

this coupling, apart from observing that p(n) + fi is the error in the 

echo canceler's tap coefficients, which eventually dies away. 

In summary, we propose the following algorithm for jointly updating 

the agc gain w(n) and the echo canceler tap coefficient vector p(/i): 

w(n + 1) = w(n) - yi e(n)(r(n) + b(n) + p(n)) (31a) 

p(/i + 1) = p(n) - -p- e(n)b(n), (31b) 

w(n) 

where yi and y2 may be different constants. Algorithm (31b) is, on the 
average, uncoupled from (31a) and equivalent to the echo canceler 
algorithm operating in solitude with step size y2. The latter algorithm 
has been examined by Mueller, 1 who determined an optimum step size 
y equal to the reciprocal of the number of taps, and demonstrated 
favorable convergence characteristics, independent of echo path char- 
acteristics. The choice of yi would best be made by experiment. While 
the above decoupling modification was only heuristically motivated, 
the simulations reported in Section VI confirm its usefulness. 

VI. ARRANGEMENT C 

Arrangement C, shown in Figs. 3a and 8a, simply omits the agc for 
purposes of making a decision on the binary symbol a(n); the quantizer 
input, which is the algebraic sum of the channel output sample and 
the echo cancel jr output, is hardlimited to ±1. Note that, if the {a(n)} 
are binary symmetric {a{n) = ±1) data symbols, then the attenuation 
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of the end-to-end channel is irrelevant and no explicit agc is necessary 
for making a decision. This comment also applies to other baseband 
data symbol formats such as diphase, and also to phase-modulated 
signals, in which case the data symbols a(n) are numbers lying on the 
unit circle in the complex plane. 

To enable adaptive adjustment of the echo canceler in arrangement 
C, the receiver's decision d(n) is scaled by an adjustable coefficient w 
before being subtracted from the unquantized output to form the error 
which is used to update the tap coefficients. The coefficient w thus has 
the role of an automatic reference control (arc), rather than an agc, 
since it adjusts the receiver's reference signal to a level commensurate 
with the attenuation of the end-to-end channel. It is adjusted jointly 
with the echo canceler tap coefficients. Moreover, it multiplies a 
discrete-valued data symbol, not a continuous-valued channel output, 
and so digital implementation of the receiver is simplified. 

This receiver arrangement can also be applied to multiamplitude 
data formats as shown in Fig. 8b. In the multiamplitude case, the 
quantizer compares the analog signal with reference levels which 
require proper scaling in relation to its amplitude. The quantity w 
provides this information and can thus directly serve as a reference 
input to the quantizer as shown in Fig. 8b. A less attractive alternative 




(a) 




(b) 

Fig. 8 — Decision-making and automatic reference control in arrangement C. (a) Case 
of binary data symbols, (b) Case of multilevel data symbols. 
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would be to multiply the input signal by \/w to bring it to a constant 
level (as with the agc schemes) and then apply it to a quantizer with 
a fixed reference. 

Given a set of N echo canceler tap coefficients {/?*} N , the receiver's 
output sample y(n), which will subsequently be quantized to form the 
output data symbol d(n), is 

N 

y(n) = r(n) + % p k b(n-k). (32) 

Ideally, the coefficients {pk)k=i should approximate the negatives of 
the echo channel's impulse response samples [fik}k~i, so that the echo 
is canceled and y(n) consists of aa(n) plus noise. 

The desired value of y(n) is wa(n), where w is an estimate of the 
end-to-end channel gain a. The error between the actual and desired 
receiver outputs, measured at the nth symbol intervals, is then 

N 

e(n) = r(n) + £ Pkb(n - k) - wa(n). (33) 

The arc parameter w implicitly assumes the role of an agc, multiplying 
the desired receiver output data symbol rather than the receiver 
input.* The parameters {/?*} and w are to be jointly adjusted with the 
aim of minimizing the mean-squared value of e(n). 
For notational compactness, define the N-dimensional vectors 

P 3 [Pi»P2, ••■,P/v] + (34a) 

and 

b(n) = [b(n - 1), b{n - 2), • • ., b(n - N)]\ (34b) 

and the (N + 1) -dimensional partitioned vectors 

c = [p i -h/J + (35a) 

and 

u(n) s [b(n) i a(n)] + . (35b) 

Then the unquantized receiver output is written 

y(n) = r(n) + pfyn), (36) 

and the error is 

e(n) = r(n) - c + u(/i). (37) 

Squaring both sides of (37) and taking the expectation, we find that 
the mean-squared error is 

* In arrangements A and B, the ideal value of w equals 1/a; in arrangement C, this 
value equals a. 
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(e(n) 2 ) =a 2 + |€| 2 , (38) 

where e = c — s and 

s- <r(/i)u(»)> = ("-)• < 39 > 

The minimum mean-squared error is a 2 , and the excess mean-squared 
error at time nT is defined as | e(n)| 2 , where 

c(/i) = c(n) - s (40) 

is the difference between the tap coefficient vector c(n) at time nT 
and its optimum value s. 

A simple gradient algorithm for updating c(n) is 

c(n + 1) = c(n) + ye(n)u(n), (41) 

where y is a constant. (Note from expression (37) that e(n)u(n) is 
proportional to the gradient of e(n) 2 with respect to c.) We shall 
examine the convergence of the excess mean-squared error <|€(n)| 2 >, 
as in earlier studies, assuming that successive input vectors u(n) are 
statistically independent. Subtracting s from both sides of (41), we 
have 

€(n + 1) = e(n) + ye(n)u{n). (42) 

Since the N + 1 components of u(n) are ±1, 

|u(n)| 2 = N+l. (43) 

Also the average of the inner product 

<e(n) + u(n)e(n)> = <e(n) + u(n)(r(n) - u(n) + c(n))> 

= <€(/i) + (s-c(n))> 

= -<|€(n)| 2 >, (44) 

where we have used the assumption that u(n) is independent of 
u(n - 1), and therefore of c(n). Using (38), (43), and (44), after squaring 
and averaging both sides of (42) we get an equation describing the 
evolution of the excess mean-squared error 

<|c(n + 1)| 2 > = <|€(n)| 2 )[l - 2 Y + y 2 (N+ 1)] + y 2 (N + l)a 2 . (45) 
This expression is readily iterated to yield 

<|e(n)| 2 > = [1 - 2 Y + y 2 (N + 1)]"<|€(0)| 2 > 

(46) 



y(N + l)a 2 [l - (1 - 2 Y + y 2 (N + l)) n ] 



2-y{N+ 1) 
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To guarantee convergence, the constant y must be such that 

2 



0< y< 



n+ r 



Then the first term in (46) is a transient, which eventually decays to 
zero and the steady-state value lim <| e{n)\ 2 ) is 



y(N + 1)<T 2 
2-y(N+l)' 

The steady-state total mean-squared error is this plus a 2 , or 

2a 2 



lim {e(nV) = 



2-y(N+ 1) 



(47) 



(48) 



Expressions (46) and (47), describing the evolution of the excess 
mean-squared error of arrangement C, are of the same form as those 
derived in Ref. 1 for the echo canceler alone, with no agc. The only 
difference is that (N + 1) replaces N (as a result of the extra agc 
coefficient w). Note that the convergence rate of (e(n) 2 ) is independent 
of all channel parameters, in contrast to the dependence of the con- 
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Fig. 9 — Reduction of mean-squared error. Arrangement A: u = 1, y = 0.01, 16 taps. 
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Fig. 10— Reduction of mean-squared error. Arrangement A: a = 0.316, y = 0.01, 16 taps. 

vergence rates of arrangements A and B on the relative echo and far- 
end signal powers. 

The choice of the adaptation coefficient y reflects a compromise 
between fast adaptation and small steady-state mean-squared error. A 
suitable choice, proposed in Refs. 1 and 11, is 



y- 



N+i' 



which yields a steady-state mean-squared error of twice the minimum 
mean-squared error a 2 . With this choice of y substituted in (46), we 
find that the excess mean-squared error decreases toward its minimum 
value (47) at a rate of 4.34/ (N + 1) dB per adjustment. 

VII. SIMULATION 

Computer simulations afforded a comparison of the actual conver- 
gence behavior of the three arrangements. Results of these simulations 
are shown in Figs. 9 through 15. In each case, the same echo channel 
is used in combination with a 16-tap canceler and y = 0.01. The echo 
power is normalized to unity, and the sampled echo response is 
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Fig. 11 — Reduction of mean-squared error. Arrangement A: 

® a = 0.1, uo = 10 1 

© a = 0.316, wo = 3.16 \ Initial value of agc is 1/a; y = 0.01, 16 taps. 

@ a - 1, wo =1 J 



truncated after 16 samples, so that perfect cancellation could be 
obtained via a set of proper coefficients. For the nonconvex arrange- 
ment B, only the decoupled updating algorithm (31) has been used 
since some initial runs without this modification showed convergence 
problems for a variety of parameter choices. 

In both arrangements A and B, a uniform, nice exponential conver- 
gence seems to be the exception rather than the rule. For weak received 
signals, the mean-squared error often initially reduces rapidly up to a 
certain point, after which convergence can become very slow. This 
problem appears to exist even in the case where the agc is preset to 
the reciprocal of the received signal. One explanation for this behavior 
is that, in those cases where the initial mean-squared error is domi- 
nantly caused by a misadjustment in only one loop, the overall behav- 
ior at first approximates that of a system where either only w or p is 
the only parameter to be adjusted. This provides a relatively fast 
reduction of the initial error to the point where a joint improvement 
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Fig. 12— Reduction of mean-squared error. Arrangement B: uo = 1, a = 0.01, 16 taps. 




100 



400 



500 



200 300 

TIME IN SYMBOL INTERVALS 
Fig. 13-Reduction of mean-squared error. Arrangement B: a = 0.316, y = 0.01, 16 
taps. 
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Fig. 14 — Reduction of mean-squared error. Arrangement B: 

© a = 0.1, uo = 10 ] 

©a = 0.316, wo = 3.16 \ Initial value of agc is 1/a; y = 0.01, 16 taps. 

® a - 1, uo =1 J 



of w awe? p is required. Once this situation is realized, a much slower 
convergence rate is expected (as we have pointed out in earlier sec- 
tions), in particular with weak received signals. In the latter case, the 
value of w must become large and any inaccuracies in p are magnified. 
Although we do not fully understand the dynamics of these systems, 
we feel it is worthwhile to present these results to point out the 
inherent problems to communication system designers. Further work 
would be required to provide a more thorough insight into these 
systems, but it is our feeling that this may be of more academic interest 
since our analysis has already shown that arrangement C provides as 
attractive a solution to the problem as one could possibly wish. This 
is demonstrated in Fig. 15. Only a single curve is presented, but actual 
simulations have shown that channel attenuation and initial arc value 
can be varied by orders of magnitude and all such resulting curves 
would be essentially identical to the one shown in Fig. 15. The fact 
that simulations tend to deliver more rapid convergence than theoret- 
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Fig. 15— Reduction of mean-squared error. Arrangement C: 16 taps, y = 1/(N + 1). 



ically predicted is, of course, due to the programmed "ideal random- 
ness" of the data sequence used in these simulations, whereas the 
analytic result is an average which includes many sequences that will 
not provide convergence at all (e.g., constant zeros, ones, or a dotting 
pattern). 

VIII. DISCUSSION AND SUMMARY 

Three arrangements of joint adaptive echo cancellation and gain 
control for full duplex data transmission have been examined. Each 
corresponds to a different agc location. The third, arrangement C, has 
proven superior in two respects: 

(i) Its convergence rate depends only on the adaptation coefficient 
and on the number of adjustable tap coefficients. On the other hand, 
arrangement A suffers slower convergence as the ratio of echo power 
to distant signal power increases. Arrangement B's echo canceler can, 
by proper choice of adaptation coefficient, on the average be made to 
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converge at a rate independent of channel parameters, but the conver- 
gence of its gain control still depends on the gain of the forward 
channel. 

(ii) Arrangement C offers simpler digital implementation; the gain 
w multiplies a data symbol, instead of a finely quantized channel 
output or receiver output. Multiplication in arrangement C becomes 
addition or subtraction in the case of binary symbols. 

Although the foregoing analyses presuppose binary data symbols, 
each receiver arrangement accommodates multilevel symbols. 

Severe intersymbol interference may necessitate forward equaliza- 
tion. If so, arrangement A with the single coefficient w(n) replaced by 
an adaptive transversal filter is necessary. This case is discussed in 
Ref. 5. The favorable convergence properties and hardware simplicity 
of arrangement C may be retained if adaptive decision feedback 
equalization with a fixed forward equalizer is used. 6,7 
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